
Genetic Alterations of Chromosome 8 
Genes in Oral Cancer 

Zachary Wei Ern Yong' 2 , Zuraiza Mohamad Zaini 1,2 , Thomas George Kallarakkal' 2 , LeePeng Karen-Ng 2 , 

Zainal Ariff Abdul Rahman 1,2 , Siti Mazlipah Ismail 1,2 , Noor Akmal Sharifah 3 , 

Wan Mahadzir Wan Mustafa 4 , Mannil Thomas Abraham 5 , Keng Kiong Tay 6 & Rosnah Binti Zain' 2 

1 Department of Oro-Maxillofacial Surgical and Medical Sciences, Faculty of Dentistry, University of Malaya, Kuala Lumpur, 
Malaysia, 2 Oral Cancer Research & Coordinating Centre (OCRCC], Faculty of Dentistry, University of Malaya, Kuala Lumpur, 
Malaysia, 3 Department of Pathology, Universiti Kebangsaan Malaysia Medical Centre, Kuala Lumpur, 4 Department of Oral 
Surgery, Hospital Kuala Lumpur, 50588 Kuala Lumpur, Malaysia, 5 Department of Oral and Maxillofacial Surgery, Hospital Tengku 
Ampuan Rahimah, Klang, Malaysia, 6 Oral Surgery Clinic, Hospital Umum Sarawak, Kuching, Malaysia. 

The clinical relevance of DNA copy number alterations in chromosome 8 were investigated in oral cancers. 
The copy numbers of 30 selected genes in 33 OSCC patients were detected using the multiplex 
ligation-dependent probe amplification (MLPA) technique. Amplifications of the EIF3E gene were found in 
27.3% of the patients, MYC in 18.2%, RECQL4 in 15.2% and MYBL1 in 12.1% of patients. The most frequent 
gene losses found were the GATA4 gene (24.2%), FGFR1 gene (24.2%), MSRA (21.2) and CSGALNACT1 
(12.1%). The co-amplification of EIF3E and RECQL4 was found in 9% of patients and showed significant 
association with alcohol drinkers. There was a significant association between the amplification of EIF3E 
gene with non-betel quid chewers and the negative lymph node status. EIF3E amplifications did not show 
prognostic significance on survival. Our results suggest that EIF3E may have a role in the carcinogenesis of 
OSCC in non-betel quid chewers. 
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n 2008, there are about 263,900 new cases of oral cancer and 128,000 deaths reported worldwide 1 . Oral cancer 
incidence rates vary extensively across the world mainly due to the types of oral cancer-associated lifestyles 
practiced by different groups of people 2 . The lifestyle behaviours associated with oral cancers include tobacco 
smoking, alcohol drinking and betel quid chewing 3 . 

The understanding of the genetic basis of oral squamous cell carcinoma (OSCC) carcinogenesis has progressed 
significantly in recent years. It is generally known that OSCC arises through a multistep process in which the 
accumulation of genetic alterations in proto-oncogenes and tumour-suppressor genes sets the foundation for 
carcinogenesis 4 . Moreover, a progression model for oral cancer has been proposed where mucosal fields with 
these genetic alterations can replace the normal epithelium in the oral cavity. Clonal evolution of the mucosal 
fields leads to invasive carcinoma 5,6 . 

Chromosomal alterations in chromosome 8 have been frequently discovered in oral cancers 7 " 9 . Gains in 
chromosome 8q and losses in 8p are the common alterations found 710 and have been shown to be involved in 
the lymph node metastasis of OSCCs 7,8 . Located in chromosome 8 is the well-known gene, c-MYC (8q24), which 
is frequently amplified in various cancers 11 " 13 . The common deleted regions found in oral cancers are 8p21, 8p22 
and 8p23 1415 . At 8p22, the FEZ1 (also known as LZTS1) gene has been shown to be frequently deleted in oral 
cancers 16 . Although many genetic aberrations have been found, the clinical relevance of many of these genes has 
not been characterized in OSCCs. 

To get a better understanding of these genetic alterations in chromosome 8, we used a high resolution PCR- 
based method called the multiplex ligation-dependent probe amplification (MLPA) as it is able to measure 
chromosomal alterations of up to 40 target locations using only small amounts of DNA 17 . MLPA allows the 
concurrent analysis of a large set of oncogenes and tumour suppressor genes which are found in chromosomal 
regions that have shown copy number changes. 

In this study, we aim to examine the genetic alterations of various genes in chromosome 8 using MLPA and find 
the associations between gene copy number changes in oral cancer with sociodemographic and clinicopatholo- 
gical parameters. 
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Figure 1 | Heat map of copy number ratios of 30 genes on chromosome 8 for 33 OSCC patients. Greener squares indicate higher copy number 
ratio while redder squares indicated lower copy number ratios. Yellow squares indicate normal copy number ratios. The dark line shows the border 
between the short arm and long arm of chromosome 8. Each column represents one OSCC patient. 



Results 

Analysis of genetic alterations using MLPA. An overview of the 
copy number ratios of all 33 OSCC patients is presented in a heatmap 
(Fig. 1). The percentage of gene alterations for all 30 genes is shown in 
Figure 2. The most frequent losses were found in the 8p region while 
most of the frequent gains were found in the 8q. Eighteen patients 
(54.5%) showed at least a loss or a gain of a gene. Twelve patients out 
of these 18 patients (66.7%) had both gains and losses, five (27.8%) 
patients showed only gains and 1 patient (5.6%) had only gene losses. 
Fifteen patients (45.5%) did not show any gene alterations. 

The genes with the most frequent gains were EIF3E in 27.3% of the 
patients, MYC (18.2%), RECQL4 (15.2%) andMYBLl (12.1%). Two 
out 33 (6.1%) patients had high level amplifications (copy number > 
2) of MYC. The rest of the genes with high level amplifications were 
ST7, EIF3E, EIF3H, EXT1 and RNF139 which were found in one of 
the patients with high level amplification of MYC. 

The most frequent gene losses were GATA4 gene (24.2% of 
patients), FGFR1 gene (24.2%), MSRA (21.2) and CSGALNACT1 
(12.1%). 

The most frequent co-amplifications were EIF3E and MYC in 1 5% 
of patients, EIF3E and MYBL1 in 12% of patients, and EIF3E and 
RECQL4 in 9% of patients. All MYBL1 amplified (n = 4) OSCCs 
were co-amplified with EIF3E. Of 6 MYC amplifications, 83% (n = 
5) were EIF3E co-amplified. Out of 5 RECQL4 amplifications, 3 were 
co-amplified with EIF3E. 

Associations with sociodemographic parameters. There was a 
significant association (p = 0.047) between the gain of EIF3E gene 
and non-betel quid chewers. The co-amplification of EIF3E and 
RECQL4 showed significant association (p = 0.022) with alcohol 
drinkers. A detailed description of this co-amplification is shown 



in Table 1. No associations were found for sex, ethnicity and 
tobacco smoking habit. 

Associations with clinicopathological parameters. The gain of 
EIF3E showed significant association with negative lymph node 
status (p = 0.01). MYC gains showed a trend towards association 
with negative lymph node status (p = 0.066). The detailed analysis of 
EIF3E and MYC are shown in Table 2 and 3 respectively. There were 
no associations found for primary tumour size, tumour site, tumour 
stage, and histological grading. 

Follow-up data was available for 29 patients, with a median follow- 
up of 16.26 months (range, 0.69-63.36 months). In our data set, 
positive lymph node status (p = 0.014) and moderately-differen- 
tiated tumours (p = 0.028) were significantly associated with poor 
overall survival. Tumour stage III and IV showed a considerable 
trend (p = 0.062) toward unfavourable overall survival. Survival 
was not associated to primary tumour size and tumour site. None 
of the genes in our study were significantly associated with overall 
survival. KCNK9 amplifications showed a trend towards association 
with poor survival (p = 0.07). 

Discussion 

Around 50% of OSCC patients succumb to this disease within 5 years 
of diagnosis despite the advancements in treatments 18 . The interest 
therefore is to decrease the mortality rate of patients which can be 
done through early detection of OSCCs. Even before cells in a tumour 
lesion exhibit dysplasia histologically, genetic alterations of onco- 
genes and tumour suppressor genes may have already been found 
in the lesion. The alterations in these genes could be used as markers 
for early screening 19,20 . 

Gains and losses in chromosome 8 have been reported in various 
types of cancers including breast cancers 11 , colorectal cancers 21 , pro- 
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Figure 2 | Percentage of amplifications (green), deletions (red) and high-level amplifications (blue) for 30 genes in chromosome 8 of 33 OSCC patients. 
The black dashed line shows the position of the centromere. 



state cancers 22 and gastric cancers 23 . Many key oncogenes and 
tumour suppressor genes in the 8q and 8p arm have been found 
frequently altered in oral cancers. These genes may play a role in 
the carcinogenesis of oral cancer and can potentially be used in 
diagnosis and prognosis. In this study, we aim to detect the genes 
which are frequently altered in oral cancer and find the association 
between these genes and clinicopathological factors. 

In our study, EIF3E (8q22-q23) was the most frequently amp- 
lified. EIF3E (also known as INT6 or EIF3S6) is one of the thirteen 
subunits of the eukaryotic translation initiation factor 3 complex 
(eIF3) and is involved in the initiation of protein synthesis. EIF3E 
was first discovered as a common integration site for mouse mam- 
mary tumour virus. Inserted viral genome causes the production of 
truncated chimeric mRNA 24 , where its overexpression has attributed 
to the malignant transformation of human mammary epithelial cells 
in both in vitro 25 and in vivo 26 . However, the overexpression wild- 
type EIF3E did not have the same outcome. Both wild-type and 
truncated EIF3E can be expressed in the same cell which shows that 
truncated EIF3E acts as a dominant negative oncoprotein while the 
wild-type protein acts as a tumour suppressor 27,28 . Its involvement in 
tumourigenesis though is still unclear as some studies have proposed 
an oncogenic role 2 ' 30 for this gene while others have proposed a 
tumour suppressor role 24,27 . Our study showed that EIF3E was amp- 
lified in 27.3% of OSCCs which is consistent with its oncogenic role 
found in breast cancers 29 and glioblastoma cells™. Moreover, the 
study on breast cancer cells found no substantial expression of trun- 



cated EIF3E which points towards a role for its wild-type form in 
promoting tumourigenesis 29 . EIF3E amplifications were also signifi- 
cantly associated with negative lymph node status. Essentially all 9 
patients with amplification of this gene had a negative lymph node 
status. This implies that although EIF3E amplification might cause 
tumourigenesis, it seems to also prevent lymph node metastasis. In 
contrast, a study done on colorectal cancers showed that in stage D 
cancers, the amplification of EIF3E was significantly higher in liver 
metastasis than primary tumours. However, this study did not find a 
significant difference in the amplification of this gene between non- 
metasising and metasising tumours 31 . The amplification of EIF3E 
seems to affect tumourigenesis and metastasis of OSCC through a 
different mechanism when compared to colorectal cancers. To better 
understand the role of EIF3E in the carcinogenesis of OSCC, further 
studies have to be done on the expression of this gene and the 
involvement of the truncated EIF3E. 

In addition, amplifications of EIF3E were significantly associated 
with OSCC patients who are non -betel quid chewers. Out of 15 non- 
betel quid chewers, 7 (46.7%) out had amplification of this gene 
compared to only 2 (11.1%) out of 18 chewers who had this amp- 
lification. This result implies that EIF3E amplification has a part to 
play in the carcinogenesis of OSCC in non-betel quid chewers. 

MYC, located at 8q24.21, is a well-known oncogene that is 
involved in cell cycle progression, apoptosis and cellular transforma- 
tion. Frequent amplifications of MYC in OSCCs were found in this 
study and are consistent with other results 1213 . In a study on head and 
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Table 1 | Sociodemographic and clinicopathological parameters of EIF3E-RECQL4 co-amplifications with OSCC cases 

EIF3E-RECQL4 

Parameters N Amplified [%) Not Amplified (%) P value (Fisher's exact test) 



Sex 






Male 


13 


3 (23.1) 


Female 


20 


0 (0.0) 


Ethniritv 




Indian 


16 


0 (0.0) 


Others 


17 


3 (1 7.6) 


Alcohol Drinking 




Drinkers 


10 


3 (30.0) 


Non-drinkers 


23 


0 (0.0) 


Smoking 




Smokers 


7 


2 (28.6) 


Non-smokers 


26 


1 (3.8) 


R^tt*l Quid CfiAwina 




Chewers 


18 


0 (0.0) 


Non-ch ewe r s 


1 5 


3 (20.0) 


Habits 




Yes 


26 


3 (1 1.5) 


No 


7 


0 (0.0) 


Tumour Sub-sites 






Tongue 


18 


3 (16.7) 


Othprs (Rurrnh 


1 5 


0 (0.0) 


Tumour Size 






Tl &T2 


18 
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1 (8.3) 


Lymph Node Status 
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1 1 


0 (0.0) 


Negative 


20 


3 (15.0) 


TNM Staging 






Stage l/ll 


12 


2 (16.7) 


Stage lll/IV 


20 


1 (5.0) 


Histological Grading 






Well differentiated 


14 


1 (7.1) 


Moderate differentiated 


17 


2 (1 1.8) 



neck squamous cell carcinomas (HNSCC), amplifications of MYC 
has been associated with advanced tumours and late tumourigen- 
esis 32 . MYC gene amplifications have also been associated with the 
progression of HNSCCs among Indians 33 . Our results showed that 
MYC amplifications could not predict survival which was similar to 
the results in other studies 33 . In our study, we found a trend associat- 
ing MYC with negative lymph node status but we did not find any 
associations with other clinicopathological factors. 

Frequent amplifications of RECQL4 were found in our study. 
REQL4, located at 8q24.3, belongs to the RecQ family of DNA heli- 
cases and functions to maintain genome integrity. Deficiencies of 
RecQ helicases have been linked to cancer predisposition and pre- 
mature aging disorders 34 . On the other hand, the amplification and 
overexpression of RECQL4 has been reported in colorectal 31 , 
breast 35 , laryngeal 36 and cervical cancers 37 which is consistent with 
what we found in this study. In a study on breast cancers, RECQL4 
has been found to be a potential metastasis promoting gene 35 . 
RECQL4 overexpression has been associated with late stage laryngeal 
squamous cell carcinomas 36 . In our study, we did not find associa- 
tions between the amplifications and late stage OSCCs which indi- 
cate that the gene expression of RECQL4 has not been affected in our 
samples even though it showed significant amplifications. 

MYBL1, a strong transcriptional activator, was found to be amp- 
lified in 4 out of 33 patients. This gene which is also known as A-Myb, 
is located at 8q22 and is mainly expressed in spermatocytes, neuronal 
cells and B-lymphoid cell. MYBL1 plays a role in the proliferation 
and differentiation of these cells. The role of this gene in cancer 
progression is less clear as it is not commonly found amplified in 
cancers. We also found a study which showed that A-Myb overex- 
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pression promotes cell proliferation in the benign tumours of the 
smooth muscle wall of the uterus (uterine leiomyoma) 38 . 

The most frequently deleted gene in our study was the GATA- 
binding protein 4 (GATA4) which belongs to the GATA family of 
zinc-finger transcription factors. This gene is located at the chromo- 
somal region 8p23.1-p22 which has been shown to be among the 
frequent targets for deletion in colorectal cancers in concordance 
with our results 3 '. The loss of GATA4 mRNA expression however, 
is attributed mostly to promoter hypermethylation as shown in vari- 
ous cancers 4 " 42 . This shows that GATA4 is a potential tumour sup- 
pressor 43 . In contrast, GATA4 was also found to be amplified and 
overexpressed in oesophageal adenocarcinomas, which suggests that 
this gene may have other roles in tumourigenesis 44 . In addition to the 
epigenetic factors, the deletion of this gene may have an effect its gene 
expression as shown by the frequent deletions found among OSCCs 
in our study 45 . 

The fibroblast growth factor receptor 1 (FGFR1) gene, located on 
8pl2, acts as the cell surface receptor for fibroblast growth factors 
and is involved in the regulation of embryonic development, cell 
proliferation, differentiation and migration. FGFR1 belongs to a 
large complex family of signalling molecules that are known to be 
involved in cancer progression 46 . In different cancers, FGFR1 ampli- 
fications have been frequently described. Amplification of FGFR1 
has been shown in 10% of breast cancers 47 , about 5% of ovarian 
cancers 48 and 3% rhabdomyosarcomas 4 '. A number of studies have 
also reported deletions of FGFR1 in addition to amplifications in 
bladder cancers 50 , lung cancers 51 and breast cancers 11 . Our study 
showed a relatively high percentage of FGFR1 deletion (24.2%) in 
OSCCs which is in contrast with the study done by Freier et al. 12 
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Table 2 | Sociodemographic and clinicopathological parameters of EIF3E amplifications with OSCC cases 

EIF3E 



Parameters N Amplified (%) Not Amplified (%) P value (Fisher's exact test) 



Sex 






Male 


13 


6 (46.2) 


Female 


20 


3 (15.0) 


Ethniritv 
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1 6 


2 (1 2.5) 
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V — ' II 1 CM o 
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7 (41 .2) 
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Smokers 
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4 (57.1) 
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20 


9 (45.0) 


TNM Staging 






Stage l/ll 


12 


5 (41.7) 


Stage lll/IV 


20 


4 (20.0) 


Histological Grading 






Well differentiated 


14 


4 (28.6) 


Moderate differentiated 


17 


5 (29.4) 



Although these FGFR1 copy number changes may not influence its 
gene expression, this difference in results indicates that FGFR1 could 
have both oncogenic and tumour protective functions as shown in a 
related receptor, FGFR2 52 . 

High level amplifications were rare among the 30 genes on chro- 
mosome 8 that we analysed in OSCCs. The most frequent was MYC 
(6.1%) with 2 cases out of 33 showing high-level amplifications which 
result is similar to the percentage of high level MYC found in breast 
cancers (8.0%) n . High level amplifications of another gene, FGFR1 
was also found in OSCCs 12 . In our study, high level amplification of 
FGFR1 was not only absent, we found that this gene was commonly 
deleted among our samples. 

A gene in our study, EXT1 (8q24.1 1) has been found to be gained 
in betel quid associated OSCC 13 . However, there were no associations 
for this gene among betel quid chewers from our study. 

The amplifications of two separate genes in a tumour may interact 
to affect tumour development. However, some genes may just be 
simply passengers which are co-amplified with other genes which 
drive tumour development. We identified three most frequent co- 
amplifications between EIF3E-MYC (15%), EIF3E-MYBL1 (12%) 
and EIF3E-RECQL4 (9%). The only association found was between 
the co -amplification of EIF3E-RECQL4 and alcohol drinkers. The 
co-amplification of EIF3E-RECQL4 was found in 3 OSCC patients 
who were all alcohol drinkers. This suggests that the carcinogenesis 
in alcohol drinkers may involve the co- amplification of these two 
genes. 

Overall, the gains and losses we found at chromosome 8 in our 
OSCC samples were similar with those found in other cancers. For 
various cancers, gene losses were frequently found in the short arm of 
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chromosome arm 8p while most gene amplifications were found in 
8q 53,54 . The same pattern of gains and losses was also found in 
OSCCs 7,810,55 . Our results were consistent with these studies. There 
were some notable difference in the level of gains and losses of some 
genes when compared to other studies on OSCCs which could be due 
to the difference in ethnicity or aetiology 2,56 . 

The heatmap (Figure 1) highlights a small part of the genetic 
heterogeneity of chromosome 8 in OSCCs. Out of the 30 genes we 
studied in chromosome 8, the number of alterations of each patient 
ranged from as high as 9 gene alterations to no alterations. It has long 
been known that cancer genomes are highly complex and unstable 6 . 
Heterogeneity within a single cancer type is known as inter-tumour 
heterogeneity. Recent studies have also shown genetic heterogeneity 
within a tumour itself 7 . It is known as intra-tumour heterogeneity 
and could occur due to both genetic and non-genetic factors 58 . As a 
result of this heterogeneity, small tissue sample obtained may not be 
representative of the whole tumour. Tumour heterogeneity may have 
a significant impact on the effectiveness of biomarkers used in dia- 
gnosis, prognosis and treatment of cancers. 

Our data showed that both positive lymph node status and mod- 
erately-differentiated graded tumours were associated with poor 
overall survival. Lymph node status is shown to be an important 
prognostic factor for oral cancers 5 '. Conversely, histological grading 
is known to have poor prognostic impact and poor reproducibility 
due to the heterogeneity of tumours 59 which was in contrast to our 
results. 

The KCNK9 gene amplifications showed a trend toward poor 
prognosis in our results. This gene has been shown to be amplified 
and overexpressed in some cancers including breast cancers 60 . 
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Table 3 Sociodemographic and clinicopathological parameters of MYC amplifications with OSCC cases 
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However, out of the 33 patients in our study, only one patient had this 
amplification while another patient had a deletion. There were no 
prognostic significance on survival for the other genes in our study. 

In conclusion, we were able to simultaneously detect gains and 
losses of genes in chromosome 8 in OSCCs using the MLPA tech- 
nique. The genes found significantly amplified in were EIF3E, MYC, 
RECQL4 and MYBL1 while the genes significantly deleted were 
GATA4, FGFR1, MSRA and CSGALNACT1. The copy number gain 
of EIF3E was the most frequently found and was associated with non- 
betel quid chewers, indicating it has both an oncogenic and meta- 
static preventing role among non-betel quid chewers. Furthermore, 
EIF3E was also associated with the negative lymph node status, indi- 
cating its potential use as a biomarker for patients with lower risk of 
lymph node metastasis. However, this gene did not show prognostic 
value in predicting patient survival. The co-amplification of EIF3E- 
RECQL4 was found to be associated with alcohol drinking which 
suggest a role for this co- amplification in alcohol induced carcino- 
genesis. Our findings showed that genetic alterations could deter- 
mine clinical characteristics. The alterations found in our study 
would assist in the development of markers for the early detection 
and prognosis of OSCC. 

Methods 

Tissue samples and DNA isolation. Fresh frozen tissues from 33 cases of oral 
squamous cell carcinoma (OSCC) were included in this study. Ten fresh frozen 
tissues of gingiva flaps taken from healthy normal non-cancer individuals during the 
minor impacted third permanent molar surgery were used as controls. All samples 
and related sociodemographic and clinical data were obtained from the Malaysian 
Oral Cancer Data and Tissue Bank System (MOCDTBS). The complete 
sociodemographic and clinicopathological details of all cases are shown in Table 4. 
Overall survival data was also collected. The tissues collected were snap frozen in 



liquid nitrogen { — 100°C to — 196°C) and were sectioned using a cryostat-microtome. 
A tissue section from each case was stained with haematoxylin and eosin (H&E) for 
histological assessment by oral pathologists. Only samples with more than 70% 
tumour cell content were further sectioned for DNA isolation. DNA was extracted 
from 750 um of tissue sections using the DNEasy Blood & Tissue Kit (Qiagen GmBH, 
Hilden, Germany) according to the manufacturer's protocol. The concentration and 
quality of the DNA were measured using the Nanodrop ND-2000 spectrophotometer. 

Multiplex ligation- dependant probe amplification. The P014-A1 MLPA probe-set 
(MRC-Holland, Amsterdam, The Netherlands) which specifically detects genes in 
chromosome 8 was used. The kit contains 32 probes which covers 30 genes on 
chromosome 8 and includes an additional 9 control probes from other chromosomes. 
The full list of genes can be found as Supplementary Table Si online. All MLPA 
experiments were performed based on the supplier's protocol 17 and in duplicates. In 
each reaction, 50 ng of DNA in a volume of 5 ul was denatured at 98°C for 5 minutes. 
Then, the DNA was cooled to 25°C and a mixture of 1.5 ill MLPA buffer and 1.5 (tl 
probe-mix was added. The mixture was then incubated at 95°C for 1 minute and then 
for 16 hours at 60 °C for the hybridization of the probes. Next, 32 ul ligation mix was 
added to each reaction and then incubated at 54°C followed by 5 minutes at 98°C to 
deactivate the Ligase-65 enzyme. PCR was performed on 1 0 ul of the ligation product 
with FAM-labelled primers. The amplified fragments were analysed using a DNA 
sequencer. 

Data analysis. The relative copy number ratio for each gene in every sample was 
calculated by dividing the normalized mean peak areas of each gene in a sample tissue 
with the normalized mean peak areas of the same gene in a tissue from non-cancer 
patients. Copy number ratios of below 0.7 were considered as loss while copy number 
ratios of 1.3 and above were considered as a gain. The data were interpreted using the 
Coffalyser software (MRC-Holland) and Microsoft Excel. 

Statistical analysis. Statistical analysis was conducted using an Excel spreadsheet and 
IBM SPSS version 20. Significant amplifications and deletions were identified using 
Fisher's exact test. Copy number associations with sociodemographic and 
clinicopathological parameters were calculated using Chi-square or Fisher's exact 
test, whenever appropriate. Survival analysis was conducted using the Kaplan Meier 
method and log-rank test. P-values < 0.05 were considered to be significant. The data 



SCIENTIFIC REPORTS | 4:6073 | DOI: 1 0. 1 038/srep06073 



6 



0) 

o 



» I » » 

O O O O 



"5 o 



.2 .8 .ffi 

ODD 



-ffi .2 .8 

ODD 



.2 .2 

D D 



.2 .2 

D D 



1 



CD CD CD CD (1) 0 

-D-D-D-D = = -D-D 

OOOO^^OO 



0 0 0 0 0 0 0 0 0 0 



z 

Q_ 



> > > 




> = > > 




> > > 


> 




> > > — 








— > > 








CO 




u u 






CO CO CO 












O O 0 


o 


O — O CN 


o 


CN O CN 


o 


0 0 o o 


CN CN CN 0 i— 


X 


o o — 


o 


— o 


o 


Z Z Z 


Z 


, z z z z 


z 


Z Z Z 


Z 


Z Z Z Z 


z z z z z 


z 


z z z 


Z 


z z z 


z 



■— 



CT> — XCNCN^t^f 



^f^^^CNn^^CN^CN^CNCNCN^^f^fCO 



D D D D 

u u u u 
u cj cj u 

D ZD ZD ZD 
_Q _Q _Q _Q 



o 



CD O) CD O) CD 



CD CD CD CD 

£ S J s 

i_ i_ CD CD CD CD 

CD CD -£= -EZ -£Z -EZ 

J J V V V V 

e i o § § S 



V) </) (O t/) 



O i! O O O O 



CD CD CD CD 



O O O O 



O O 
u u 
u u 



D) O) O) HI O) O) O) CD O3O50DCD CD CD 



CD CD CD 

S S £ 

CD CD CD 



CD CD 



CD 0 - 

S St 

CD CD 



CD CD CD CD - 

s s s s 

CD CD 0 CD 



CD CD - 1 — 

S S V 

CD CD § 



uuzzzzuzzzuuzuzuzuzuuzuzuuuuzuuzu 



u 
u 
to 

o 

[~ — — —X. _SC —X. _X _3C 

CO -r- ccccccc 

ro ~~o - c *c 
= ~9~9~9~9~9~9~9. 



(D ID 0 <D 111 ID ID 



!-o-o 



(D CD (D CD CD 



000000000 



~9~9~9~9~9~9~9~9~9. 



0 CD 0 CD CD 



! "9 "9 "9 "9 "9 



o 

Q_ 



_2 

o 



o 

Q_ 

o 



c 
o 



Q_ 

O 

S 5 



0 
g 
o 

CO 



_CU 
D 



oooooooc-ccoo-cccccooooooooo-ccooooo 
ZZZZZZZoooZZoooooZZZZZZZZZooZZZZZ 



o o o 
III 



CD 


CD 


CD 












O 


O 


O 




E 


E 


E 


CD 


to 


to 


to 


cz 


EZ 


EZ 


O 


O 


O 


O 


Sm 


Z 


Z 


z 



5 ° 
-9 E 



CD 




CD CD 
_^ 




CD CD 






CD CD CD 


CD 
^c 


CD CD 
_^ 


0 0 0 0 0 




CD 


CD 


CD CD 


jOUIS-l. 


CD 


O 0 

E E 

CO to 
EZ EZ 


CD 


0 O 

E E 

CO CO 
EZ EZ 


CD 


CD 


jOLUS-l 


O 

E 

tO 

cz 


O 0 
E E 

to to 
c c: 


O 0 O O 0 
E E E E E 

CO CO CO to CO 
C EZ C C EZ 


CD 


o 
E 

to 
EZ 


o 
E 

to 
CZ 


O 0 
E E 

to to 
tz C 


Nor 


o 
E 
CO 


O O 

Z Z 


O 

E 

CO 


O o 
Z Z 


o 
E 

CO 


o 
E 

CO 


O O O 

z z z 


O 
Z 


O O 

z z 


O O O O O 

z z z z z 


o 
£ 

CO 


o 

z 


o 

z 


O O 

z z 



CD CD CD 
to to to 
CD CD CD 



gg^c^EzOEZEzEggEzgcg 

.9 _EZ DiTjiixTJTJjzTJjzTJ 

-E-ESuSU-EUUU-5-EU-EU-E 



0 

cu>^>^czcz>^cz>vczczczcz>vczcz>^>^ 
c_o_o g g_o g_o g g g g _g g goo 

IE O Dtjtj Dt} CTjijTJTJ 9 "O ~D J3 O 

uSS-E-eS-eS-E-E-E-eS-E-eSS 



_cd_cd _cd_cd_cd jd jd _cd _cd _2 cd _cd _cd_cd_cd_cd_cd_cd _cd_cd 

CDOOcDOOOcDCDCDOOOOcDCDCDOcDOOcDOOOOOOcDOOcDCD 

"5 E E"5 E E E-5-5-D E E E E "5 ~o ~5 E "5 E E "0 E E E E E E "6 E E "o "5 

-<CDCD-rCDCDCD-<-<-<<DCDCDCD-<-<-<CD-<CDCD-<CDCDCDCDCDCD-<CDCD-<-< 
^U_U_^Ll_U_U_^^^LJ_Ll_Li_Ll_^^^Li_^Li_U_^U_Li_Ll_Li_Ll_Ll_-CLl_Ll_-C-^ 



□ O — CNCO-^-cOOKoOOO — CNCO-^-co<3KcjOOO — CNCO 

(J ^C>lcro^cO<lKcXlC>^^^^^^^^^^CNCNCNCNCNCSCNCNCSCNcrocOCOCO 



: REPORTS I 4 : 6073 | 



DOI: 10.1038/srep06073 



7 



were grouped as follows: amplified vs non- amplified, male vs female, Indians vs 
others (Malays and Chinese), tobacco smokers vs non-smokers, alcohol drinkers vs 
non-drinkers, betel quid chewers vs non-chewers, tongue vs others (buccal), primary 
tumour size (T1/T2 vs T3/T4), lymph node (LN+ vs LN— ), tumour stage (Stage I/II 
vs Stage III/IV), and histological grading (well vs moderate). 
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